Legacy language atlas data mining: mapping Kru languages
نویسنده
چکیده
An online tool based on dialectometric methods, DistGraph, is applied to a group of Kru languages of Côte d’Ivoire, Liberia and Burkina Faso. The inputs to this resource consist of tables of languages x linguistic features (e.g. phonological, lexical or grammatical), and statistical and graphical outputs are generated which show similarities and differences between the languages in terms of the features as virtual distances. In the present contribution, attention is focussed on the consonant systems of the languages, a traditional starting point for language comparison. The data are harvested from a legacy language data resource based on fieldwork in the 1970s and 1980s, a language atlas of the Kru languages. The method on which the online tool is based extends beyond documentation of individual languages to the documentation of language groups, and supports difference-based prioritisation in education programmes, decisions on language policy and documentation and conservation funding, as well as research on language typology and heritage documentation of history and migration.
منابع مشابه
ATLaS: A Native Extension of SQL for Data Mining
A lack of power and extensibility in their query languages has seriously limited the generality of DBMSs and hampered their ability to support data mining applications. Thus, there is a pressing need for more general mechanisms for extending DBMSs to support efficiently database-centric data mining appliacations. To satisfy this need, we propose a new extensibility mechanism for SQL-compliant D...
متن کاملClassification of Guébie within Kru
Guébie, a Kru language spoken in Côte d’Ivoire, is currently doubly classified within Eastern Kru according to Ethnologue (Lewis et al. 2013). It is listed as a dialect of two distinct subgroups, Bété and Dida. This double classification is clearly problematic, and this paper provides the initial work towards addressing the correct classification of the language. Here I compare the phonological...
متن کاملATLAS: A Small but Complete SQL Extension for Data Mining and Data Streams
DBMSs have long suffered from SQL’s lack of power and extensibility. We have implemented ATLaS [1], a powerful database language and system that enables users to develop complete data-intensive applications in SQL—by writing new aggregates and table functions in SQL, rather than in procedural languages as in current Object-Relational systems. As a result, ATLaS’ SQL is Turing-complete [7], and ...
متن کاملATLaS: A Native Extension of SQL for Data Mining and Stream Computations
A lack of power and extensibility in their query languages has seriously limited the generality of DBMSs and hampered their ability to support new application domains. Considerable efforts by database researchers and commercial DBMS vendors have led to major extensions; yet there remain important applications—particularly data mining—that are not supported well in SQL-3. Thus, there is a pressi...
متن کاملApplying Cognitive Principles of Similarity to Data Integration - The Case of SIAM
Increasingly, modern system design is concerned with the integration of legacy systems and data. Consequently, data integration is an important step in many system design projects and also a prerequisite to data warehousing, data mining, and analytics. The central step in data integration is the identification of similar elements in multiple data sources. In this paper, we describe an applicati...
متن کامل